-70- 



CLAIMS 

What is claimed is: 



1 1 . A method for branching during programmable processing in a computer 

2 graphics pipeline, comprising: 

3 (a) receiving data; 

4 (b) performing programmable operations on the data in order to generate output, 

5 wherein the operations are programmable by a user utilizing instructions 

6 from a predetermined instruction set; 

7 (c) branching between the programmable operations in a programmable manner; 

8 and 

9 (d) storing the output in memory. 

1 2. The method as recited in claim 1 , wherein the programmable operations are 

2 branched to labels. 

1 3. The method as recited in claim 2, wherein the labels are stored in a table. 

1 4. The method as recited in claim 3, wherein the programmable operations are 

2 branched to indexes in the table. 

1 5. The method as recited in claim 4, wherein each index is stored in an address 

2 register. 

1 6. The method as recited in claim 2, wherein each index is calculated. 

1 7. The method as recited in claim 1 , and further comprising terminating the 

2 programmable operations after a predetermined number of operations have 

3 been performed. 



NVIDP055/P000369 V2.0 



-71 - 



1 8. The method as recited in claim 1 , wherein the programmable operations are 

2 branched based on condition codes. 

1 9. The method as recited in claim 8, wherein the condition codes are sourced as 

2 EQ(equal), NE(not equal), LT(less), GE(greater or equal), LE(less or equal), 

3 GT(greater), FL(false), and TR(true). 

1 10. The method as recited in claim 8, wherein the condition codes are maskable. 

1 11. The method as recited in claim 8, wherein the condition codes are swizzled. 

1 12. The method as recited in claim 1 , wherein the operations are selected from 

2 the group consisting of a branch operation, a call operation, a return 

3 operation, a cosine operation, a sine operation, a floor operation, a fraction 

4 operation, a set-on-equal-to operation, a set false operation, a set-on-greater- 

5 than, a set-on-less-than-or-equal operation, a set-on-not-equal-to operation, a 

6 set true operation, a no operation, address register load, move, multiply, 

7 addition, multiply and addition, reciprocal, reciprocal square root, three 

8 component dot product, four component dot product, distance vector, 

9 minimum, maximum, set on less than, set on greater or equal than, 

10 exponential base two (2), logarithm base two (2), exponential, logarithm, 

1 1 and/or light coefficients. 

1 13. A computer program product for branching during programmable processing 

2 in a computer graphics pipeline, comprising: 

3 (a) computer code for receiving data; 

4 (b) computer code for performing programmable operations on the data in order 

5 to generate output, wherein the operations are programmable by a user 

6 utilizing instructions from a predetermined instruction set; 

7 (c) computer code for branching between the programmable operations in a 

8 programmable manner; and 
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9 


(d) 


computer code for storing the output in memory. 


1 


14. 


A system for branching during programmable vertex processing, comprising: 


2 


(a) 


a source buffer for storing data; 


3 


(b) 


a functional module coupled to the source buffer for performing 


4 




programmable operations on the data received therefrom in order to generate 


5 




output,, wherein the operations are programmable by a user utilizing 


6 




instructions from a predetermined instruction set; and 


7 


(c) 


a register coupled to the functional module for storing the output; 


8 


(d) 


wherein functional module is capable of branching between the 


9 




programmable operations. 


1 


15. 


A method for programmable processing with condition codes in a computer 


2 




graphics pipeline, comprising: 


3 


(a) 


receiving data; 


4 


(b) 


performing programmable operations on the data in order to generate output, 


5 




wherein the operations are programmable by a user utilizing instructions 


6 




from a predetermined instruction set including condition codes; and 


7 


(c) 


storing the output in memory. 


1 


16. 


The method as recited in claim 15, wherein the condition codes are utilized 


2 




to move between the programmable operations. 


1 


17. 


The method as recited in claim 15, wherein the condition codes are utilized 


2 




to control write masks. 


1 


18. 


The method as recited in claim 17, wherein the write masks are controlled 


2 




utilizing an AND operation. 


1 


19. 


A computer program product for programmable processing with condition 


2 




codes in a computer graphics pipeline, comprising: 
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3 (a) computer code for receiving data; 

4 (b) computer code for performing programmable operations on the data in order 

5 to generate output, wherein the operations are programmable by a user 

6 utilizing instructions from a predetermined instruction set including 

7 condition codes; and 

8 (c) computer code for storing the output in memory. 

1 20. A system for programmable vertex processing with condition codes, 

2 comprising: 

3 (a) a source buffer for storing data; 

4 (b) a functional module coupled to the source buffer for performing 

5 programmable operations on the data received therefrom in order to generate 

6 output, wherein the operations are programmable by a user utilizing 

7 instructions from a predetermined instruction set including condition codes; 

8 and 

9 (c) a register coupled to the functional module for storing the output. 

1 21. A method for programmable processing in a computer graphics pipeline 

2 utilizing a predetermined instruction set, comprising: 

3 (a) receiving data from a source buffer; 

4 (b) performing programmable operations on the data in order to generate output, 

5 wherein the operations are programmable by a user utilizing instructions 

6 from a predetermined instruction set; and 

7 (c) storing the output in a register; 

8 (d) wherein the operations include at least ten (10) operations selected from the 

9 group consisting of a branch operation, a call operation, a return operation, a 

10 cosine operation, a sine operation, a floor operation, a fraction operation, a 

1 1 set-on-equal-to operation, a set false operation, a set-on-greater-than, a set- 

12 on-less-than-or-equal operation, a set-on-not-equal-to operation, a set true 

13 operation, a no operation, address register load, move, multiply, addition, 

14 multiply and addition, reciprocal, reciprocal square root, three component dot 
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1 5 product, four component dot product, distance vector, minimum, maximum, 

16 set on less than, set on greater or equal than, exponential base two (2), 

17 logarithm base two (2), exponential, logarithm, and/or light coefficients. 

1 22. A method for executing a function in a computer graphics pipeline, 

2 comprising: 

3 (a) receiving input data in a computer graphics pipeline; 

4 (b) directly performing a mathematical function on the input data in order to 

5 generate output data, wherein the mathematical function is directly 

6 performed in the computer graphics pipeline; and 

Ail 

j|| 7 (c) storing the output data in memory on the computer graphics pipeline. 

o 

Fj 1 23. The method as recited in claim 22, wherein the mathematical function is 

O 2 selected from the group consisting of sine, cosine, exponent base two (2), and 



3 logarithm base two (2). 

1 24. The method as recited in claim 22, wherein the mathematical function is 

2 selected from the group consisting of sine, cosine, tangent, arctangent, 

3 exponent, logarithm, antilogarithm, hyperbolic sine, hyperbolic cosine, 

4 hyperbolic tangent, and hyperbolic arctangent. 

1 25 . The method as recited in claim 22, wherein the mathematical function is a 

2 function in which an initial n derivatives are capable of being tabulated and 

3 accessed via an interpolation operation. 

1 26. The method as recited in claim 22, wherein the input data is in a floating- 

2 point format. 

1 27. The method as recited in claim 22, wherein the mathematical function is 

2 performed utilizing a Taylor Series. 
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1 28. The method as recited in claim 22, wherein the mathematical function is 

2 performed utilizing a cordic algorithm. 

1 29. The method as recited in claim 22, and further comprising converting the 

2 input data from a first coordinate system to a second coordinate system. 

1 30. The method as recited in claim 22, wherein the mathematical function is 

2 carried out in one cycle in the computer graphics pipeline. 

1 31. A computer program product for executing a function in a computer graphics 

2 pipeline, comprising: 

3 (a) computer code for receiving input data in a computer graphics pipeline; 

4 (b) computer code for directly performing a mathematical function on the input 

5 data in order to generate output data, wherein the mathematical function is 

6 directly performed in the computer graphics pipeline; and 

7 (c) computer code for storing the output data in memory on the computer 

8 graphics pipeline. 

1 32. A system for executing a function in a computer graphics pipeline, 

2 comprising: 

3 (a) a computer graphics pipeline for receiving input data, directly performing a 

4 mathematical function on the input data in order to generate output data, and 

5 storing the output data in memory; 

6 (b) wherein the mathematical function is directly performed in the computer 

7 graphics pipeline. 

1 33. A system for executing a function in a computer graphics pipeline, 

2 comprising: 

3 (a) means for receiving input data in a computer graphics pipeline; 
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4 (b) means for directly performing a mathematical function on the input data in 

5 order to generate output data, wherein the mathematical function is directly 

6 performed in the computer graphics pipeline; and 

7 (c) means for storing the output data in memory on the computer graphics 

8 pipeline. 

1 34. A method for executing a function, comprising: 

2 (a) receiving input data; 

3 (b) identifying the function to be executed on the input data; 

4 (c) pre-processing the input data based on the function to be executed on the 

5 input data; 

6 (d) processing the input data utilizing a plurality of operations independent of the 

7 function to be executed on the input data; 

8 (e) post-processing the input data to generate output data; and 

9 (f) storing the output data in memory. 

1 35. The method as recited in claim 34, wherein the pre-processing includes 

2 adding a one (1) to the phase of the input data if the function to be executed 

3 on the input data is cosine. 

1 36. The method as recited in claim 34, wherein the pre-processing includes 

2 multiplying the input data by (l/(2ir) + 1) if the function to be executed on 

3 the input data is at least one of sine and cosine. 

1 37. The method as recited in claim 34, wherein the pre-processing includes 

2 performing a conditional 1 's complement operation on the input data if the 

3 function to be executed on the input data is at least one of sine or cosine. 

1 38. The method as recited in claim 34, wherein the pre-processing includes 

2 performing a barrel shift operation on the input data if the function to be 

3 executed on the input data is the exponent operation. 
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1 39. The method as recited in claim 34, wherein the processing includes 

2 extracting a set of most significant bits and a set of least significant bits from 

3 a mantissa associated with the input data. 

1 40. The method as recited in claim 34, wherein the processing further includes 

2 conditionally adding a one (1) to the most significant bits. 

1 41. The method as recited in claim 34, wherein the processing includes 

2 calculating a Taylor Series. 

1 42. The method as recited in claim 41 , wherein the processing includes looking 

2 up information in a plurality of tables. 

1 43. The method as recited in claim 42, wherein the tables are utilized based on 

2 the function to be executed on the input data. 

1 44. The method as recited in claim 42, wherein the information includes a 

2 plurality of derivatives. 

1 45 . The method as recited in claim 43, wherein the tables are hard-coded. 

1 46. The method as recited in claim 43, wherein the tables are programmable. 

1 47. The method as recited in claim 43, wherein the tables are loaded at runtime. 

1 48. The method as recited in claim 44, wherein the derivatives are summed. 

1 49. The method as recited in claim 34, wherein (b)-(f) are carried out in one 

2 cycle. 
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1 50. The method as recited in claim 34, wherein the function includes at least one 

2 of sin(x) and cos(x), where x is in at least one of degrees and radians. 

1 51. A computer program product for executing a function, comprising: 

2 (a) computer code for receiving input data; 

3 (b) computer code for identifying the function to be executed on the input data; 

4 (c) computer code for pre-processing the input data based on the function to be 

5 executed on the input data; 

6 (d) computer code for processing the input data utilizing a plurality of operations 

7 independent of the function to be executed on the input data; 

8 (e) computer code for post-processing the input data to generate output data; and 

9 (f) computer code for storing the output data in memory. 

1 52. A system for executing a function, comprising: 

2 (a) logic for receiving input data; 

3 (b) logic for identifying the function to be executed on the input data; 

4 (c) logic for pre-processing the input data based on the function to be executed 

5 on the input data; 

6 (d) logic for processing the input data utilizing a plurality of operations 

7 independent of the function to be executed on the input data; 

8 (e) logic for post-processing the input data to generate output data; and 

9 (f) logic for storing the output data in memory. 

1 53 . A method for executing a function in a computer graphics pipeline, 

2 comprising: 

3 (a) receiving input data in a computer graphics pipeline; 

4 (b) identifying a sign, an exponent, and a mantissa associated with the input data 

5 utilizing the computer graphics pipeline; 

6 (c) normalizing the input data utilizing the computer graphics pipeline; 

7 (d) if the function includes an exponent function, executing a barrel shift 

8 operation on the input data utilizing the computer graphics pipeline; 
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9 (e) if the function includes a cosine function, adding a one (1) to the phase of the 

1 0 input data utilizing the computer graphics pipeline; 

11 (f) if the function includes at least one of a sine function and a cosine function, 

1 2 multiplying the input data by ( 1 /(2t) + 1 ) and performing a conditional 1 ' s 

1 3 complement operation on the input data utilizing the computer graphics 

14 pipeline; 

1 5 (g) extracting a set of most significant bits and a set of least significant bits from 

16 the mantissa associated with the input data utilizing the computer graphics 

17 pipeline; 

1 8 (h) adding a one (1) to the most significant bits utilizing the computer graphics 

19 pipeline; 

20 (i) looking up a plurality of derivatives based on the most significant bits 

2 1 utilizing the computer graphics pipeline; 

22 (j) calculating a Taylor Series with the derivatives and the least significant bits 

23 utilizing the computer graphics pipeline; 

24 (k) post-processing the input data to generate output data utilizing the computer 

25 graphics pipeline; 

26 (1) storing the output data in memory on the computer graphics pipeline. 
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